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RNA polymerase (RNAP) is molecular machine that polymerizes a RNA molecule, a linear het¬ 
eropolymer, using a single stranded DNA (ssDNA) as the corresponding template; the sequence of 
monomers of the RNA is dictated by that of monomers on the ssDNA template. While polymerizing 
a RNA, the RNAP walks step-by-step on the ssDNA template in a specific direction. Thus, a RNAP 
can be regarded also as a molecular motor and the sites of start and stop of its walk on the DNA 
mark the two ends of the genetic message that it transcribes into RNA. Interference of transcrip¬ 
tion of two overlapping genes is believed to regulate the levels of their expression, i.e., the overall 
rate of the corresponding RNA synthesis, through suppressive effect of one on the other. Here we 
model this process as a mixed traffic of two groups of RNAP motors that are characterized by two 
distinct pairs of start and stop sites. Each group polymerizes identical copies of a RNA while the 
RNAs polymerized by the two groups are different. These models, which may also be viewed as two 
interfering totally asymmetric simple exclusion processes, account for all modes of transcriptional 
interference in spite of their extreme simplicity. A combination of mean-field theory and computer 
simulation of these models demonstrate the physical origin of the switch-like regulation of the two 
interfering genes in both co-directional and contra-directional trafhc of the two groups of RNAP 
motors. 


I. INTRODUCTION 

Synthesis of messenger RNA, a linear heteropolymer, 
using the corresponding template DNA, is called tran¬ 
scription; it is carried out by a molecular machine called 
RNA polymerase (RNAP) [l|. This machine also ex¬ 
ploits the DNA template as a filamentous track for its 
motor-like movement consuming input chemical energy 
0. Polymerization of each RNA by a RNAP takes place 
in three stages: (a) initiation at a specific ‘start’ site 
(also called initiation site) on the template, (b) step-by- 
step elongation of the RNA, by one nucleotide in each 
forward step of the RNAP motor, and (c) termination 
at a specific ‘stop’ site (also called termination site) on 
the template. For the sake of convenience, throughout 
this paper we refer to the segment of the template DNA 
between the start and the stop sites as a ‘gene’. 

RNAPs moves from 3' to the 5' direction on a single 
strand of DNA. Often multiple RNAPs polymerize the 
same gene simultaneously. In such RNAP traffic all 
the RNAPs engaged simultaneously in the transcription 
process move in the same direction while polymerizing 
identical copies of a RNA, all by initiating transcription 
from the same start site and terminating at the same 
stop site. Since any segment of the template DNA cov¬ 
ered by one RNAP is not accessible simultaneously to any 
other RNAP, this steric exclusion gives rise to nontrivial 
spatio-temporal organization of RNAPs in RNAP traf¬ 
fic. Theoretical models of this kinetic process have been 
developed over the last few years [1, |^, [23 - 1^ by appro- 
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prlately adaptin^and extending, asymmetric exclusion 
process (ASEP) Q , a popular model in nonequilib¬ 
rium statistical mechanics that we describe briefly in the 
next section 

In this communication we report theoretical studies of 
more complex RNAP-trafhc phenomena that are believed 
to play important regulatory roles in living cells iini. 
These phenomena arise from simultaneous transcription 
of two overlapping genes either on the same DNA tem¬ 
plate or two genes on the two adjacent single strands 
of a duplex (double-stranded) DNA. In the former case, 
traffic is entirely uni-directional although RNAPs tran¬ 
scribing different genes polymerize two distinct species 
of RNA molecules by starting (and stopping) at differ¬ 
ent sites on the same template DNA strand. In con¬ 
trast, in the latter case, RNAP traffic in the two adja¬ 
cent “lanes” move in opposite directions transcribing the 
respective distinct genes. In both these situations the 
phomenon of suppressive influence of one transcriptional 
process on the other is called transcriptional interference 
(TI) [Hill. 

In general, a RNAP at the initiation, elongation or 
termination stage of one transcriptional process can sup¬ 
press the initiation, or elongation (or termination) of the 
other transcription by another RNAP US]. In other 
words, the stages of transcription of the two interfereing 
RNAPs define a distinct mode of interference. Different 
modes of interference have been assigned different names 
like “occlusion”, “collision”, “sitting duck interference”, 
etc. [l3| . Many pairs of interfering transcription pro¬ 
cesses are known to form a bistable switch: switching 
ON a high level of transcription of one of the two genes 
can switch OFF the other by its suppressive effect [§-H0l| . 
The main aim of this communication is to demonstrate 
this effect using a unified theoretical framework that we 
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develop here. This framework is capable of throwing light 
on many other kinetic aspects of TI phenomena. 

In this paper we develop a unified theoretical frame¬ 
work that, for a given relative orientation of two genes, 
captures all possible modes of TI by a single set of mas¬ 
ter equations. Solving these equations, we investigate the 
effects of TI on the rates of the two transcriptions. More¬ 
over, by carrying out extensive computer simulations of 
our model, we also test the validity of the mean-field ap¬ 
proximations made in formulating the master equations. 
We explore the effects of the spatial extent and rela¬ 
tive orientation of the overlap of the two genes as well 
as those of the kinetic parameters like initiation rates. 
Our results demonstrate interesting regulatory phenom¬ 
ena arising from TI. In particular, the transcription of 
one gene can be practically switched off by increasing 
that of another to a sufficiently high level. 

II. MODEL 




FIG. 1: Schematic representation of TI model, (a) Codirec- 
tional mixed traffic with separate ramps: transcription of two 
overlapping genes, (b) Contradirectional traffic on adjoin¬ 
ing unequal tracks: sense and anti-sense transcription of two 
overlapping genes 

We represent a single-stranded DNA (ssDNA) by a lin¬ 
ear chain (i.e., a one-dimensional lattice) of equispaced 
sites that are labelled by the integer index i. Each site 
of this lattice denotes a nucleotide which is a monomeric 
subunit of the DNA track. The chain serves both as the 
template and track for the respective RNAP motors that 
are engaged in its transcription. We also represent each 
RNAP by a hard rod, i.e., an extended particle, of length 
I in the units of nucleotide length, i.e., it covers £ suc¬ 
cessive sites of the lattice simultaneously. Normally, i is 
typically 30 to 35 nucleotides. We denote the position of 
a RNAP on its track by the lattice site at which the left¬ 
most unit of the rod is located while the next i—l sites of 
the lattice are merely covered by the RNAP. Thus, if the 
lattice site j denotes the position of a RNAP then the 


RNAP covers not only the site j but also the next £ — 1 
sites j -£ 2, ...,j -\- £ — 1. The RNAPs interact with 
each other with only hard core repulsion that is captured 
by imposing the condition that no lattice site is allowed 
to be covered by more than one RNAP simultaneously. 

We model both codirectional TI (TTI) and contradi¬ 
rectional TI (CTI) using suitably extended TASEPs. For 
modeling interference of the expression of two genes we 
incorporate two interfering TASEPs each characterized 
by the respective distinct pair of start and stop sites and 
two distinct species of RNAP motors. For modeling TTI 
only a ssDNA template needs to be treated as the track 
for both group of RNAP motors that transcribe the two 
distinct genes simultaneously. But, for modeling simulta¬ 
neous polymerization of sense and anti-sense transcripts, 
which are encoded on the two distinct strands of a du¬ 
plex DNA, we use two antiparallel lattices on which the 
contra-directional traffic of the two groups of motors take 
place. 

We identify two sites separated by h nucleotides as the 
start sites for the two genes; h is an integer that can 
be positive, negative or zero (see figH]). A fresh initia¬ 
tion of transcription of a gene, however, is not possible 
as long as the first £ sites, starting from the start site of 
that gene, remain fully or partly covered by any other 
RNAP, irrespective of the identity of the gene that is 
being transcribed by the latter. We denote the rates 
of initiation of transcription of the two genes by oi, 02 , 
respectively. Whenever £ successive sites, starting from 
the start site of a gene on the DNA template is vacant, 
a fresh RNAP is allowed to cover those £ sites thereby 
initiating the corresponding transcription. Each RNAP 
carries an unique label 1 or 2 depending on which of the 
two genes it is engaed in transcribing; the label is as¬ 
signed to it depending on the start site from where it 
begins its walk on its track. Irrespective of the actual 
numerical value of £, each RNAP can move forward by 
only one site in each step, provided the target site is not 
already covered by any other RNAP. Single-site stepping 
rule is motivated by the fact that a RNAP must tran¬ 
scribe the successive nucleotides one by one. A RNAP 
engaged in the transcription of one of the two genes can 
detach from the lattice only after it reaches the stop site 
of the corresponding gene. However, so far as the rates of 
termination are concerned, we assume the corresponding 
rates to be Pi = £^2 = P- AI and L2 denote the lengths of 
the two genes, measured in terms of the number of lattice 
sites from the start to the stop sites of the corresponding 
gene. The interference between the transcription of these 
two genes takes place in the region of their overlap. 

In the case of TTI no RNAP can pass the other imme¬ 
diately in front of it irrespective of which genes are being 
transcribed by the two RNAPs. This is motivated by the 
fact that in case of TTI both the genes are encoded on 
the same ssDNA strand. In contrast, in the case of CTI, 
when two RNAPs labelled by two distinct integer indices 
1 and 2 (i.e., transcribing different genes) face each other 
head-on, these are allowed to pass, albeit with a hop- 
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ping rate that is lower than that in the absence of the 
obstruction, i.e., qi < Qi and q 2 < Q 2 - This prescrip¬ 
tion is motivated by the fact that the genes for the sense 
and antisense transcripts are encoded on the two distinct 
complementary strands of the duplex DNA which serve 
as the tracks for the corresponding oppositely moving 
RNAP traffic. However, in the same model of CTI if 
a RNAP finds itself just behind another co-directional 
RNAP in front (i.e., both transcribe the same gene and, 
therefore, carry the same integer label) then the trailing 
RNAP remains stalled till the site in front of it is vacated 
by the leading RNAP. The motivation for this prescrip¬ 
tion is that the two RNAPs engaged in transcribing the 
same gene move on the same ssDNA strand. 

A. Master equations under mean-fleld 
approximation 

Let t) denote the probability that at time t there 
is a RNAP at site i engaged in the transcription of the 
gene /i (/r = 1,2 for the genes 1 and 2, respectively). 
Note that the probability that the site i is occupied by a 
RANP, irrespective of the gene it is transcribing, is given 

by P{i) = 


1. Co-directional traffic 

Let P{i\j) be the conditional probability that, given 
a RNAP at site i, there is another RNAP at site 
j located downstream along the lattice. Obviously, 
C(i|j) = 1 ~ P{i\j) is the conditional probability that, 
given a RNAP at site i, site j is empty. Therefore, by 
definition, 

1 - E P(i + s) 

^ (il* + i) = -- 

1 - p{i + s) + p{z + e) 

S=1 

Let ^{j) be the probability that site j is not covered by 
any RNAP, irrespective of the state of occupation of any 
other site, is given by 1 — X]s=o ~ s). Note that, if 
site i is given to be occupied by one RNAP, the site i — 1 
can be covered by another RNAP if, and only if, the site 
i — i is also occupied. 

Under mean-field approximation, the master equations 
governing the stochastic kinetics of the two interfereing 
transcriptional processes are given by (for h > 0) 


dPiil,t) 

dt 

dPi{i,t) 

dt 

dPi{Ll,t) 

dt 

dP2il h,t) 
dt 


dP2{i,t) 

dt 

dP2{h + L2,t) 
dt 




QPiji - l,t)^ (z - QPi{i,t)^{i\itj for, (1 < i < LI) , 

QPi(Ll - l,t)£ (Ll-l |Ll - 1 -h£) -/3Pi(Ll,t), 

a2 - E (i-E P{h + — QP2{X + /i, t) ^ (1 -h h \l h f), 


s=l 


= 1 for (h=0) 

QP 2 (i — 1, — l \i — I i) — QP 2 {ijt)^{i\i for, {1h < i < hL2), 

QP2{h + L2 - 1, t)C( hPL2-l \h PL2-1P£)- pP2{h + L2, t). 


(1) 


r 


Equations for h < 0 can be obtained from m by inter¬ 
changing index 1 and 2. 


2. Contra-directional traffic 


gene ^{p = 1,2) at time t, at lattice site i. Obviously, 
Ci(il* + ^) is the conditional probability that, given a 
RNAP on gene 1 at site i, site i-\-1 is empty. Therefore, 
by definition. 


Effect of CTI on gene expression can be studied by 
writing down and solving master equation for P^{i,t), 
which denotes the probability of finding an RNAP on 
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1- E Pi{i + s) 

6(11* + ^) = -^- 

1- E Pi{i + s) + Piii + e) 

S=1 

Similarly, ^ 2 {* ~ ^|i) is the conditional probability that, 
given a RNAP on gene 2 at site i , site i — £ is empty. 
Therefore, by definition, 


Let CiO) be the probability that site j on gene 1 is not 
covered by any RNAP, irrespective of the state of occupa¬ 
tion of any other site, obviously, ^r (j) = 1 — Ef=o P^ (j ~ 
s). Similarly, ^ 2 (j), the probability that site j on gene 2 is 
not covered by any RNAP, irrespective of the state of oc¬ 
cupation of any other site, is given by 1 — Ef=o P'^^^ ~ '®)- 


6(*-^li) = 


1 - E P 2 {^ - s) 

S = 1 


1- E P 2 {i - s) + P^ii - i) 

-s=i- 


Under mean-field approximation, the master equations 
are written as 


dPi (l,t) 

dt 

dPi {i,t) 
dt 

dPi (Ll,t) 
dt 

dP2 (1 -I- h,t) 
dt 


dP2 {i,t) 
dt 

dP2 {2 + h - L2,t) 
dt 


= ai 


1 - ^ Pi (s, t) 


l-^P2(s,t) 


-^^r(lU)er(l|l + £) [Qi6(l +e) + 9r{l - 6(1 + £)}], 


= 1 for (h—L2>^) 

Pi(j - l,t)$i (i - l \i -1 + £) [Qi6(* - 1 + 6 + 9i{l - 6(* - 1 + ^)}] 

Pi{ht)ii{i\i + £)[Qi£. 2 {i + £)+qi{^-i 2 {i + £)}] for, (l<i<Ll), 

Pi (PI - 1, U6 (PI-I IPI -1 + 6 [QiUL^ - 1 + 6 + 9i{i - 6(ii - 1 + e)}] - pPi (PI, t ), 


= 0.2 


1 - y^-Pi (1+ 


s=0 


1 — ^ P 2 (1 + ^ — S, t) 


s=0 


= 1 for (h—Ll>^) 

- P2(l + /i,t)6(l + h- £\l±h) [Q2^i{h) + g2{l - ^i{h )}], 

= P2(j + i,t)6(* + 1 - £\ i + 1) [<326(6 + 92{i - 6(6}] 

- P2(666(* - ^16 [Q26(* - 1) +'?2{i - 6(* - 1)}] for, (2 + h - P2 < z < i +/i) , 

= P2(3 + h - P2,66(3 + h-L2- £\ 3 + h-L2) [626(2 + h - L2) + q2{l - ^i{2 + h - P2)}] 

- pP2{2 + h- L2,t). 


( 2 ) 


III. RESULTS 


Solving the master equations CD and ([2]) numerically 
under steady state conditions we obtained the corre¬ 
sponding rates of the transcriptions of the two genes. 
Moreover, in order to test the range of validity of the 
MFA made in writing the master equations, we also car¬ 
ried out extensive direct computer simulations of our 
model using the same set of parameter values that we 
used for solving the master equations. During the simu¬ 
lations, we monitored the flux of the RNAPs. The system 
needed, typically, about two million time steps to attain 


the steady state after which we collected the steady-state 
data over the next five million time steps. The steady- 
state properties presented in this paper are averages of 
the data collected only in the steady state of the system. 
All the numerical results plotted in this paper have been 
obtained for £ = 10, Pi = 1000 and P 2 = 1100; by com¬ 
paring with the results for a few other lengths of RNAPs 
and genes, we ensured that our conclusions do not suffer 
from any artefacts of the choice of these parameters. 
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FIG. 2: Codirectional: The switch like behavior of fluxes 
of RNAPs, plotted as a function of oi, for Q = 30 
02 = 5 s“^, Pi = P 2 = 10005“^ and /i = 20 fep.The dashed 
line and solid line corresponds to our mean-field theoretic 
predictions for flux 1 and flux 2 respectively whereas the 
discrete data points (cross and star corresponds for flux 1 
and flux 2 respectively) have been obtained from computer 
simulations.The insets show the average density profiles for 
three different values of oi .When the gene does not have a 
transcriptional interference, its expression follows different ki¬ 
netics.Corresponding flux is plotted as a function of 02 , for 
Q = 30 3“*^ (ai = 0).The solid orange line corresponds to our 
mean-field theoretic predictions and violet data points have 
been obtained from computer simulations. 



FIG. 3: Codirectional: The roles of the two genes as “sup¬ 
pressor” and “suppressed’ are interchanged as h is varied from 
-N to -l-N (N=40 in this figure just for illustration). 


A. Results for co-directional traffic 

In figlU we plot the fluxes of RNAPs in the two traf¬ 
fic which, as explained above, are the overall rates of 
transcription of the two genes. First of all, note that 
for any given value of 0 : 2 , gene 2 would have got tran¬ 
scribed normally at a fairly high rate if the gene 1 were 
not interfering with its transcription. As long as ai is 
not too high, the rate of transcription of gene 2 is weakly 
affected by infrequent co-directional “collisions” and pro¬ 
ceeds at a fairly high rate. But, as ai increases the time 


gap detected at any arbitrary site between the departure 
of a RNAP and the arrival of the next RNAP becomes 
shorter. Therefore, the site for the initiation of tran¬ 
scription of gene 2, which is located on the path of the 
RNAP traffic on gene 1, remains “occluded” for most of 
the time if ai is sufficiently high. Consequently, a high 
rate of expression of gene 1 strongly suppresses the ex¬ 
pression of gene 2, irrespective of the actual numerical 
value of a 2 - Thus, the rates of transcription of the two 
genes are strongly anti-correlated. 

The role of “suppressor” and suppressed” genes are in¬ 
terchanged as the separation h between the transcription 
initiation sites is varied from a positive integer to a nega¬ 
tive integer (see figO]) The sharp changes take place only 
over a narrow interval of the order of ^ around h = 0. 


B. Results for contradirectional traffic 

Although the results on flux, plotted in figlU are 
qualitatively similar to the corresponding results for co¬ 
directional traffic plotted in figUl there are some addi¬ 
tional features. The kinks observed in the density pro¬ 
files shown in the insets of figH] are consequences of the 
extended “defect” created by the slower moving RNAPs 
against the faster moving ones M- 



FIG. 4: Contradirectional: Same as in figO except that the 
traffic is contradirectional and h = 500, Q\ — Q 2 ~ 30 s“^, 
qi = q2 = 10 02 = 1.0 s“\ Pi = P2 = 1.0s-\ LI = 

L2 = 1000. 


IV. SUMMARY AND CONCLUSION 

Co-directional and contra-directional two-species 
TASEP, both on a single track and two parallel tracks, 
have been studied earlier for purely theoretical consid¬ 
eration as well as for capturing real physical processes 
[isl - fl^ . But, in all those models a single pair of start 
and stop sites serve as the entry and exit points for both 
species of particles. Motivated by cytoskeletal motor 
traffic, TASEP-based models with two distinct species 
of oppositely moving self-propelled particles have been 
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developed earlier [Tol - f^ . Since those motors can attach 
to and detach from any lattice site, except for quantita¬ 
tive difference in the values of attachment/detachment, 
the motor kinetics at the two ends of the track were 
qualitatively no different from those at any other site. 
In contrast, for the transcription of a specific gene, 
RNAP motors have to start and stop their walk at 
pre-designated sites. Moreover, premature detachment 
of a RNAP, which would produce a truncated RNA 
strand, is not allowed in our model because such errors 
occur very rarely. 

In most of the earlier theoretical models on RNAP traf¬ 
fic [ 234 ^ all the RNAP were engaged in transcribing a 
single gene; therefore the traffic was uni-directional and 
every RNAP polymerized identical copies of the RNA 
while a single pair of start-stop sites marked the points of 
initiation and termination of transcription. In contrast, 
in the models reported in this paper two distinct pairs of 
start-stop sites mark the points of initiation and termina¬ 
tion of the respective genes. Moreover, the RNA species 
that gets polymerized by a RNAP depends on the sites 
from which it initiates transcription. Thus copies of two 
distinct species of RNA get synthesized simultaneously 
by a mixed population of two groups of RNAP motors 
the relative direction of whose movements is dictated by 
the relative orientation of the two genes. 

In our model of TI in co-directional mixed RNAP traf¬ 
fic we assumed that a RNAP passively waits at its current 
position if the target nucleotide in front is already cov¬ 


ered by another RNAP. The temporarily stalled RNAP 
can resume its forward movement, and the concomitant 
transcriptional activity, only after its target site is va¬ 
cated by the RNAP immediately in front of it. We have 
also ignored the possibility of backtracking of the indi¬ 
vidual RNAP motors [l^, [S, [13 ■ In future extensions of 
our model [ll| , we intend to explore the effects of back¬ 
tracking, active re-starting of stalled RNAP by a trail¬ 
ing RNAP [13,113 as well as premature detachment of 
RNAPs upon suffering collision. 

In addition to the differences in the models developed 
here and all other TASEP-type models described above, 
the main questions addressed in those models are also 
fundamentally different from those addressed in this pa¬ 
per. In all the earlier theoretical works the effects of the 
different modes of interference have been studied sepa¬ 
rately m, 113 . The simple TASEP-based unified model 
that we have developed here not only captures all pos¬ 
sible modes of transcriptional interference, but also ac¬ 
counts for the self-regulation of the pair of genes through 
the adaptation of the levels of their interfering transcrip¬ 
tions. 
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